Hybrid Hierarchical Clustering: Forming a Tree From Multiple Views
نویسندگان
چکیده
We propose an algorithm for forming a hierarchical clustering when multiple views of the data are available. Different views of the data may have different underlying distance measures which suggest different clusterings. In such cases, combining the views to get a good clustering of the data becomes a challenging task. We allow these different underlying distance measures to be arbitrary Bregman divergences (which includes squared-Euclidean and KL distance). We start by extending the average-linkage method of agglomerative hierarchical clustering (Ward’s method) to accommodate arbitrary Bregman distances. We then propose a method to combine multiple views, represented by different distance measures, into a single hierarchical clustering. For each binary split in this tree, we consider the various views (each of which suggests a clustering), and choose the one which gives the most significant reduction in cost. This method of interleaving the different views seems to work better than simply taking a linear combination of the distance measures, or concatenating the feature vectors of different views. We present some encouraging empirical results by generating such a hybrid tree for English phonemes.
منابع مشابه
Extending Consensus Clustering to Explore Multiple Clustering Views
Consensus clustering has emerged as an important extension of the classical clustering problem. Given a set of input clusterings of a given dataset, consensus clustering aims to find a single final clustering which is a better fit in some sense than the existing clusterings. There is a significant drawback in generating a single consensus clustering since different input clusterings could diffe...
متن کاملA new algorithm for hybrid hierarchical clustering with visualization and the bootstrap
We propose a hybrid clustering method, Hierarchical Ordered Partitioning And Collapsing Hybrid (HOPACH), which is a hierarchical tree of clusters. The methodology combines the strengths of both partitioning and agglomerative clustering methods. At each node, a cluster is partitioned into two or more smaller clusters with an enforced ordering of the clusters. Collapsing steps uniting the two clo...
متن کاملHybrid Hierarchical Classifiers for Hyperspectral Data Analysis
We propose a hybrid hierarchical classifier that solves multiclass problems in high dimensional space using a set of binary classifiers arranged as a tree in the space of classes. It incorporates good aspects of both the binary hierarchical classifier (BHC) and the margin tree algorithm, and is effective over a large range of (sample size, input dimensionality) values. Two aspects of the propos...
متن کاملHierarchical Data Gathering Scheme for Energy Efficient Wireless Sensor Network
Compressive Data Gathering (CDG) has recently emerged to provide energy efficient data aggregation in wireless sensor networks (WSNs). Existing CDG is done by forming a span-tree from all sensors to the data sink, which leads to a lot of energy consumption in maintain this tree. In this paper, we present a clustering based CDG scheme for largescale WSNs. This scheme reduces traditional network ...
متن کاملVisualizing Large Graphs with Compound-Fisheye Views and Treemaps
Compound-fisheye views are introduced as a method for the display and interaction with large graphs. The method relies on a hierarchical clustering of the graph, and a generalization of the traditional fisheye view, together with a treemap representation of the cluster tree.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005